Rapid online adaptation using speaker space model evolution
نویسندگان
چکیده
This paper presents a new approach to online adaptation of continuous density hidden Markov model (CDHMM) with a small amount of adaptation data based on speaker space model (SSM) evolution. The SSM which characterizes the a priori knowledge of the training speakers is effectively described in terms of the latent variable models such as the factor analysis or probabilistic principal component analysis. The SSM provides various sources of information such as the correlation information, the prior density, and the prior knowledge of the CDHMM parameters that are very useful for rapid online adaptation. We design the SSM evolution based on the quasi-Bayes estimation technique which incrementally updates the hyperparameters of the SSM and the CDHMM parameters simultaneously. In a series of speaker adaptation experiments on the continuous digit and large vocabulary recognition tasks, we demonstrate that the proposed approach not only achieves a good performance for a small amount of adaptation data but also maintains a good asymptotic convergence property as the data size increases. 2004 Elsevier B.V. All rights reserved.
منابع مشابه
Online Adaptation of Continuous Density Hidden Markov Models Based on Speaker Space Model Evolution
In this paper, we propose a new approach to online adaptation of continuous density hidden Markov model (CDHMM) based on speaker space model evolution. The speaker space model which characterizes the a priori knowledge of the training speakers is effectively described in terms of the latent variable model such as the factor analysis (FA) or probabilistic principal component analysis (PPCA). The...
متن کاملMarkov models based on speaker space model evolution
In this paper, we propose a new approach to online adaptation of continuous density hidden Markov model (CDHMM) based on speaker space model evolution. The speaker space model which characterizes the a priori knowledge of the training speakers is effectively described in terms of the latent variable model such as the factor analysis (FA) or probabilistic principal component analysis (PPCA). The...
متن کاملRapid speaker adaptation by reference model interpolation
We present in this work a novel algorithm for fast speaker adaptation using only small amounts of adaptation data. It is motivated by the fact that a set of representative speakers can provide a priori knowledge to guide the estimation of a new speaker in the speaker-space. The proposed algorithm enables an a posteriori selection of reference models in the speakerspace as opposed to the a prior...
متن کاملSpeaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation
A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...
متن کاملSpeaker adaptation by modeling the speaker variation in a continuous speech recognition system
A method for unsupervised instantaneous speaker adaptation is presented and evaluated on a continuous speech recognition task in a man-machine dialogue system. The method is based on modeling of the systematic speaker variation. The variation is modeled by a low-dimensional speaker space and the classification of speech segments is conditioned by the position in the speaker space. Because the e...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Speech Communication
دوره 42 شماره
صفحات -
تاریخ انتشار 2004